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ABSTRACT:  Using  dynamic  combinatorial  chemistry,  mix¬ 
tures  of  dipeptide  monomers  were  combined  to  probe  how  the 
structural  elements  of  a  family  of  self-assembled  [2] -catenanes 
affect  their  equilibrium  stability  versus  competing  non- 
catenated  structures.  Of  particular  interest  were  experiments 
to  target  the  effects  of  CH— n  interactions,  inter-ring  hydrogen 
bonds,  and  /2-turn  types  on  [2]-catenane  energetics.  The  non¬ 
variant  core  of  the  [2]-catenane  was  shown  only  to  adopt  type 
IP  and  type  VTII  turns  at  the  /2-2  and  /2-4  positions, 
respectively.  Monomers  were  designed  to  delineate  how 

these  factors  contribute  to  [2]-catenane  equilibrium  speciation/stability.  Dipeptide  turn  adaptation  studies,  including  three- 
component  dynamic  self-assembly  experiments,  suggested  that  stability  losses  are  localized  to  the  mutated  sites,  and  that  the  turn 
types  for  the  core  /2- 2  and  /2-4  positions,  type  IT  and  type  VTII,  respectively,  cannot  be  modified.  Mutagenesis  studies  on  the  core 
Aib  residue  involved  in  a  seemingly  key  CH— n—  CH  sandwich  reported  on  how  CH— n  interactions  and  inter-ring  hydrogen 
bonds  affect  stability.  The  interacting  methyl  group  of  Aib  could  be  replaced  with  a  range  of  alkyl  and  aryl  substituents  with 
monotonic  affects  on  stability,  though  polar  heteroatoms  were  disproportionately  destabilizing.  The  importance  of  a  key  cross¬ 
ring  H-bond  was  also  probed  by  examining  an  Aib  for  L-Pro  variant.  Inductive  affects  and  the  effect  of  CH  donor  multiplicity  on 
the  core  proline— n  interaction  also  demonstrated  that  electronegative  substituents  and  the  number  of  CH  donors  can  enhance 
the  effectiveness  of  a  CH— n  interaction.  These  data  were  interpreted  using  a  cooperative  binding  model  wherein  multiple  non- 
covalent  interactions  create  a  web  of  interdependent  interactions.  In  some  cases,  changes  to  a  component  of  the  web  lead  to 
compensating  effects  in  the  linked  interactions,  while  in  others,  the  perturbations  create  a  cascade  of  destabilizing  interactions 
that  lead  to  disproportionate  losses  in  stability. 


■  INTRODUCTION 

Multivalency  and  cooperative  binding  are  phenomena  that 
occur  throughout  the  genre  of  molecular  recognition,  though 
the  latter  is  principally  observed  in  the  biological  context.1 
While  many  molecular  systems  participate  in  multivalency,  true 
cooperativity,  i.e.,  binding  strengths  that  are  greater  than  the 
sum  of  the  parts,  is  rare  for  synthetic  systems.2  One  contributor 
to  the  chasm  between  synthetic  and  biological  systems  is  the 
inherently  small  size  and  the  lack  of  complexity  of  synthetic 
systems  that  tend  to  be  amenable  to  study  by  traditional 
spectroscopic  and/or  thermochemical  means.  The  process  of 
“designing”  a  molecular  recognition  system  additionally  creates 
a  bias  toward  the  molecular  interactions  that  were  used  to 
create  the  structure's  desired  recognition  properties.  This  heavy 
hand  overwhelms  the  subtle  factors  that  could  conspire  to 
create  cooperativity  and  additionally  distances  the  results  from 
the  biological  context  where  they  are  most  often  observed.  The 
study  of  multivalency  and  cooperative  binding  is  therefore 
challenged  by  the  constraints  of  being  “designed”  and  the 
technical  difficulties  of  studying/manipulating  complex  (large) 
systems  known  to  display  cooperativity.  The  generation  of 
spontaneously  assembled  systems  held  together  with  multiple, 


easily  varied  interactions  creates  a  new  tool  for  the  study  of 
multivalency  and  cooperative  binding  in  complex  systems. 

In  our  previous  paper,3  we  described  a  phenomenon  wherein 
mixtures  of  dipeptides  self-assemble  under  dynamic  conditions 
into  stereochemically  and  constitutionally  precise  [2] -cate¬ 
nanes,  wherein  one  56-membered  macrocyclic  ring  interlocks 
with  a  second  ring  to  give  highly  ordered  structures  that  are 
intermediate  in  size  and  complexity  between  typical  molecular 
recognition  and  biological  systems.  Despite  their  large  size  (>3 
kDa),  these  compounds  could  be  precisely  characterized  by 
high-resolution  techniques  that  include  X-ray  crystallography, 
NMR,  and  mass  spectrometry.  These  tools  revealed  that  the 
[2] -catenanes  are  assembled  from  a  complex  web  of  molecular 
interactions  that  include  both  intra-  and  inter-macrocycle 
hydrogen  bonds,  a  broad  array  of  edge-to-face  and  slipped  face- 
to-face  n—n  interactions,  and  seemingly  critical  CH— n 
interactions.  Other  defining  structural  features  include  the 
utilization  of  /2-turns,  which,  at  least  in  the  core,  are  highly 
conserved  and  adopt  biomimetic  turn  types  (/2-2  is  type  IT  and 
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fi-4  is  type  VIII).  Each  of  these  non-covalent  interactions  and 
the  /Tturn  secondary  structures  are  common  elements  in 
protein  structures. 

In  the  [2]-catenanes  characterized  in  our  previous  paper,3 
one  invariant  region  of  the  structure  is  the  “core”,  where  the 
two  conserved  aryl  glycine  units  seed  a  network  of  CH —n 
interactions  and  inter-macrocycle  hydrogen  bonds.  The 
absolute  requirement  for  an  aryl  glycine-containing  monomer 
in  the  dynamic  self-assembly  reaction  (DSA),  and  the 
observation  that  each  arylGly  unit  engages  in  CH— n 
interactions  on  both  n— faces,  led  to  the  hypothesis  that  these 
interactions,  expressed  through  the  CH— n—  CH  sandwich,  were 
acting  to  template  the  synthesis  and  stabilize  the  catenane. 

Relatively  weak  CH— n  interactions  are  not  well  studied, 
despite  their  ubiquity  and  influence  over  important  chemical 
phenomena,  which  include  small-molecule  conformation,4 
chiral  discrimination,5  assembly  of  host— guest  complexes, 4a’6 
molecular  recognition,7  selectivity  of  organic  reactions, 6:1,8 
crystal  packing,  a’9  bio  conjugates,4  and  protein— protein  inter¬ 
actions  involving  polyproline  helices.10  The  well- characterized 
nature  of  the  catenane  and  the  modular,  thermodynamically 
controlled  self-assembly/ synthesis  protocol  suggested  an 
experimental  strategy  for  probing  the  importance  and  the 
properties  of  the  CH— n  interactions  present  in  the  self- 
assembled  [2]-catenanes.  The  desire  to  examine  the  CH— n 
interaction  in  a  complex  setting  was  stimulated  by  the 
constraints  of  small-molecule  studies,  which  can  measure  the 
strength  of  the  CH— n  interaction  in  isolation,  and  computa¬ 
tional  studies,  which  can  probe  the  CH— n  interaction  in  simple 
systems.  Studies  capable  of  manipulating  this  interaction  in  a 
complex,  multivalent  scheme  that  bridge  the  gap  to  the 
biological  context  are  scant. 

With  the  goal  of  delineating  a  structure/ stability  relationship, 
we  report  the  outcome  of  our  investigations  to  manipulate  the 
[2]-catenane's  defining  CH— n  interactions  and  subsequently  its 
core  /Tturns.  The  previous  paper3  provided  a  structural 
blueprint  for  designing  modified  monomers  to  perturb  the 
strength  of  the  CH— n  interaction,  as  determined  from  the 
relative  stability  of  the  folded  [2] -catenane  versus  competing 
non-catenated  species.  The  nature  of  these  experiments 
required  careful  controls  to  support  the  notion  that  the 
examined  subtle  perturbations  were  principally  affecting  the 
highly  structured  and  folded  catenanes  rather  than  the  non- 
catenated  cyclic  oligomers. 

These  mutation  studies  indicate  that  a  minimum  number  of 
CH— n  interactions  are  necessary,  but  not  sufficient  for  the  self- 
assembly  of  these  catenanes.  Mutations  in  the  CH— n 
participating  core  /Tturns  (/?-2  and  /?-4)  revealed  that  the 
conserved  turn  structures  were  critical  to  the  stability  of  the 
[2] -catenanes.  Structure— stability  correlations  showed  the 
CH— n  interactions  to  be  part  of  an  interconnected  network 
of  non-covalent  interactions  that  encompass  the  entire 
structure.  Semi- quantitative  measurements  of  stability  indicate 
that  even  small  structural  perturbations  can  become  magnified 
or  attenuated  in  this  network.  For  example,  the  excision  of  two 
CH3  groups  involved  in  CH— ^  interactions  causes  a  >  1000-fold 
drop  in  catenane  stability.  As  will  be  discussed,  the  source  of 
this  loss  is  a  combination  of  CH— n  loss,  a  change  in  /Tturn 
conformational  preference,  and  a  concomitant  loss  of  highly 
conserved  inter-ring  hydrogen  bonds.  In  other  mutations,  the 
expected  drop  in  stability  was  attenuated,  and  metrical  evidence 
points  to  compensating  structural  reorganizations  which  restore 
some  of  the  lost  stability.  These  experiments  thus  enable  one  to 


examine  multivalency  and  perhaps  even  cooperativity  in  a 
regime  of  significant  structural  complexity.  As  such,  this  system 
enables  one  to  begin  revealing  how  the  nature  of  molecular 
interactions  can  be  modulated  by  other  structural  elements. 

■  EXPERIMENTAL  SECTION 

General  Methods.  Chemicals  were  purchased  from  Aldrich,  Fisher 
Scientific,  and  Chem-Impex  International,  Inc.  and  used  as  received. 
XH  and  13C  NMR  spectra  of  monomers  were  recorded  on  a  Bruker 
DRX  500  spectrometer  and  processed  using  TOPSPIN  Bruker  NMR 
software  (V.  3.0).  High-resolution  mass  spectra  were  obtained  on  an 
Agilent  Accurate  LC-TOF  mass  spectrometer  (Agilent  Series  6220) 
operating  in  positive-ion  mode  with  an  electrospray  ionization  source 
(fragmentor  =  175  V).  The  data  were  analyzed  using  Agilent 
MassHunter  Workstation  Software,  Qualitative  Analysis  (V. 
B.02.00).  HPLC  analysis  was  performed  on  a  Hewlett-Packard  Series 
1100  instrument,  using  a  Halo-C18  column  (4.6  X  150  mm,  2.7  //m) 
with  gradient  elution  (methanol/water)  at  a  flow  rate  of  0.45  mL/min 
and  at  55  °C.  The  injection  volume  for  a  5  mM  dynamic  self-assembly 
(DSA)  reaction  was  typically  3.5  //L.  UV  absorbance  chromatograms 
were  recorded  at  wavelengths  of  220  and  289  nm.  The  data  were 
analyzed  using  the  Agilent  Chemstation  software.  Accurate  Mass  LC- 
TOF  analyses  of  DSA  reactions  were  performed  on  an  Agilent  Series 
1200  LC  instrument,  using  a  Halo-C18  column  (2.1  X  50  mm,  2.7 
// m)  with  gradient  elution  (methanol/water  containing  0.1%  formic 
acid)  at  a  flow  rate  of  0.5  mL/min  and  at  50  °C.  The  eluent  was 
analyzed  by  an  Accurate  Mass  LC-TOF  mass  spectrometer  (Agilent 
Series  6220)  in  positive-ion  mode  with  an  electrospray  ionization 
source  (fragmentor  =  375  V).  The  data  were  analyzed  using  Agilent 
MassHunter  Workstation  Software,  Qualitative  Analysis  (V.  B.02.00). 

Self-Assembly  of  [2]-Catenanes  for  HPLC  and  LC-TOF 
Analyses.  Five  millimolar  DSA  solutions  were  prepared  on  a  1  mL 
scale,  d-  or  d,l-1x  (5  //mol)  and  l,l-72  or  L,L-y2ph  (5  //mol)  were 
separately  dissolved  in  1  mL  of  a  mixture  of  MeCN  and  CHC13  (1:3), 
and  0.5  mL  of  the  d-  or  d,l-1x  solution  was  mixed  with  0.5  mL  of  the 
l,l-72  or  L,L-y2ph  solution  and  trifluoroacetic  acid  (50  equiv).  The 
solutions  were  allowed  to  sit  for  several  days  until  a  steady  state  was 
reached  (typically  7—10  days)  prior  to  LC  analyses. 

Speciation  and  Quantification  of  [2]-Catenanes  by  HPLC 
and  LC-TOF.  The  speciation  and  characterization  of  [2] -catenanes  in 
the  DSA  solution  were  performed  by  HPLC  and  LC-TOF  analyses. 
For  HPLC  analyses,  3.5  //L  of  DSA  solution  was  injected  onto  a  Halo- 
08  column  (4.6  X  150  mm,  2.7  //m)  at  a  flow  rate  of  0.45  mL/min  at 
55  °C.  To  remove  solvent  and  TFA,  a  short  linear  gradient  (35—46% 
MeOH/H20  over  4  min)  was  used.  A  second  gradient  composed  of 
two  linear  gradients  (46—66%  MeOH/H20  over  26  min  and 
consecutive  66%  MeOH/H20  to  100%  MeOH  over  12  min)  was 
used  to  separate  non-catenane  species,  and  then  100%  MeOH  was 
subsequently  used  for  an  additional  12  min  for  [2] -catenane 
speciation.  The  [2] -catenanes  in  the  DSA  solution  were  quantified 
from  their  UV  trace  areas  (289  nm)  and  compared  to  a  calibration 
curve  generated  from  known  concentrations  of  several  isolated 
catenanes  and  their  corresponding  UV  trace  areas  (289  nm).  The 
[2] -catenanes  in  the  DSA  solution  were  identified  by  LC-TOF 
analyses  in  positive-ion  mode  with  an  electrospray  ionization  source 
(fragmentor  =  375  V,  Vcap  =  3500  V,  mass  range  =  100—3200  m/z). 
A  1.0  //L  DSA  solution  was  injected  onto  a  Halo-C18  column  (2.1  X 
50  mm,  2.7  //m)  at  a  flow  rate  of  0.50  mL/min  at  50  °C.  To  remove 
non-catenane  species,  solvent,  and  TFA,  a  short  gradient  (60—70% 
MeOH/H20  containing  0.1%  formic  acid)  was  used.  A  second 
gradient  (70—90%  MeOH/H20  containing  0.1%  formic  acid  over  7 
min)  and  a  subsequent  90%  MeOH/H20  (0.1%  formic  acid)  mobile 
phase  were  utilized  for  an  additional  4  min  for  catenane  speciation.  See 
Supporting  Information-2  for  HPLC  traces  of  DSA  solutions  and  LC- 
TOF  analyses  of  the  generated  catenanes. 
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■  RESULTS 

1.  [2]-Catenane  Generation.  Under  dynamic  self- 
assembly  (DSA)  conditions,  mixtures  of  two  dipeptide 
monomers  that  are  epimeric  at  proline  (D-la  and  l,l-h2  or 
H2ph)  self- assemble  into  octameric  [2]-catenanes  (Scheme  l)11 


Scheme  1.  [2]-Catenane  Self-Assembly  from  Two  Dipeptide 
Monomers  (D-la  and  l,l-h2  or  H2ph) 


Among  these  non-covalent  interactions,  the  most  intriguing 
were  the  CH —n  interactions  which  grip  the  naphthyl  ring  of 
naphthylglycine  between  a  proline  and  an  Aib  residue  (Figure 
lb)  (Aib  =  1-aminoisobutyric  acid,  or  a-methyl-alanine).  The 
rigidity  of  the  peptide  loops  and  the  core  structure  were  probed 
through  H/D  exchange  experiments  in  protic  D-solvents 
(CD3OD,  D20/pyridine).  The  exchangeable  amide  and 
hydrazone  NH  groups  revealed  a  variety  of  chemical  environ¬ 
ments,  including  those  in  the  outer  loops  which  exchange 
readily,  and  those  flanking  the  core  naphthyl,  which  were  much 
slower.  Two  key  NH’s  forming  inter-ring  hydrogen  bonds  in 
the  core  (ft-2  and  ft-4  turns  of  the  catenane  H3a;  see  turn 
designations  in  Figure  lb)  resisted  exchange  even  after  multiple 
weeks  in  an  aggressive  H/D  exchange  solvent  (D20/pyridine). 
In  other  examined  catenanes,  this  H/D  exchange  eventually 
occurred  but  much  more  slowly  than  in  the  outer  loops.  As  fast 
exchange  was  taken  to  reflect  structural  flexibility  and 
breathability,  these  observations  suggest  that  the  core  is 
rigidified/ stabilized  by  the  CH— n  interactions,  which  sandwich 
the  naphthyl  rings  and  pairs  of  inter-ring  hydrogen  bonds. 

X-ray  structure  analysis3  additionally  revealed  that  the  core  of 
each  catenane  is  composed  of  a  type  IT  ft-2  turn  and  a  type  VIII 
ft-4  turn.  In  contrast,  the  peripheral  turns  (ft-l  and  ft-3)  can 
adopt  either  type  I ',  type  I,  or  type  IF  /?-turns.  The  invariability 
of  the  core  suggests  that  these  /?-turn  types  may  also  play  a  role 
in  facilitating  the  above  core  interactions. 

Given  the  notion  that  the  CH —n  interaction  was  the  key 
differentiating  recognition  element,  we  first  sought  to  gain 
insight  by  perturbing  the  ft-2  CH  donor  (Scheme  2b)  and 


in  a  diastereoselective  and  constitutionally  selective  fashion.  X- 
ray  structure  analysis3  showed  that  the  structure  contains  a 
dense  core  from  which  extend  loops  terminating  in  /Fturns. 

As  shown  in  Figure  la,  the  structure  is  flower-like  and 
assembled  from  two  interlocked  saddle-shaped  loops  where 


CH-tt 


Aib  (p-2) 


L-Pro 

(P-4) 


(a)  Petal-like 
D-Pro-X 


Paired  D-Pro-X  (b) 


Aib  (p-2) 


Figure  1.  (a)  X-ray  structure  of  H3a  displaying  two  identical 
interlocked  tetramers.  (b)  View  from  the  bottom-side  of  (a)  showing 
CH —n  interactions  between  the  naphthyl  (space-filled  shape)  and  l- 
Pro  moiety  and  between  the  naphthyl  and  a  methyl  group  of  the  Aib 
residue.  All  hydrogen  atoms  except  those  participating  in  the  CH— n— 
CH  sandwiches  are  omitted  for  clarity. 


each  macrocycle  is  formed  from  three  units  of  1  and  one  unit  of 
2,  i.e.,  [  1321]2.  X-ray  and  NMR  structural  investigation  of 
catenane  H3a  revealed  that  its  dense  self-complementary 
structure  is  stabilized  by  a  combination  of  six  intra-  and  six 
inter-macrocyclic  hydrogen  bonds,  numerous  face-to-face  and 
edge-to-face  aromatic  interactions,  and  the  previously  men¬ 
tioned  CH— n  interactions.  These  interactions  were  commonly 
found  in  all  examined  catenanes.3 


Scheme  2.  Illustration  of  the  CH— Jt— CH  Sandwich  in  H3a 
Showing  CH— 71  Interactions  (a)  between  the  Naphthyl  and 
L-Pro  Moiety  (ft- 4  CH —it)  and  (b)  between  the  Naphthyl 
and  a  Methyl  of  the  Aib  Residue  (ft- 2  CH— 7t) 


determining  how  the  [2] -catenane  responded.  To  this  end,  a 
series  of  methyl  deletion  experiments  were  designed  to 
manipulate  the  Aib  residue  of  ft-2.  Since  this  group  originates 
from  the  D-Pro-Aib  unit,  this  experiment  was  accomplished  by 
employing  the  D-Pro-L-Ala,  D-Pro-D-Ala,  or  D-Pro-Gly  mono¬ 
mers  for  the  DSA  reaction  with  monomer  l,l-h2  (Table  l).  As 
described  in  Scheme  2b,  this  series  of  monomers  remove  the 
non-interacting  methyl  group  (Me2)  in  the  D-Pro-L-Ala  case, 
the  interacting  methyl  group  (Mel)  in  the  D-Pro-D-Ala  case, 
and  both  groups  in  D-Pro-Gly  scenario.  Not  surprisingly,  the  d- 
Pro-L-Ala  situation  (entry  2)  gave  only  a  slightly  lowered 
concentration  of  [2] -catenanes  relative  to  Aib  (50  vs  70%),  as 
this  methyl  deletion  combination  should  leave  the  CH— n 
interaction  largely  untouched.  The  D-Pro-D-Ala  scenario  (entry 
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Table  1.  [2]-Catenanes  Generation  from  D-,  d,l-,  or  d,d-1x  (x 
=  a,  b,  v)  and  l,l-h2 


D.L-lb,  D,D-1b 


[2]-catenanes 


h3x  h4x 


Entry  R  Group 


[2]-Catenane  yields  (%)a 
H3x  H4x  Others5  Total 


1 

a 

38  (63) 

29  (3) 

5  (0) 

72  (66) 

2 

L-b 

40  (56) 

15  (5) 

0  (0) 

55  (61) 

3 

D-bC 

T 

0(0) 

0  (0) 

0  (0) 

0  (0) 

4 

V 

H 

31  (50) 

7  (4) 

1  (0) 

39  (54) 

"Yields  obtained  from  the  UV  trace  of  the  corresponding  catenane  in 
each  DSA  solution  at  day  7—9  (steady  state)  using  a  calibration  curve. 
The  calibration  curve  was  obtained  from  the  HPLC-UV  area  (289  nm) 
of  several  known  concentrations  of  pure  catenanes;  details  in  SI1 
Figure  1.  The  values  in  parentheses  were  obtained  from  3:1  (d:l) 
biased  DSA  solutions.  ^Other  constitutional  isomers  of  3x  and  4x. 
cOnly  trace  amounts  of  [2] -catenanes  were  generated  (<3C  1%). 


3);  however,  was  unexpected,  as  it  led  to  a  large  decrease 
(>  1000-fold)  in  catenanes.  While  on  the  surface  this  suggested 
that  the  CH3— n  interactions  (there  are  two  C2-related 
positions)  were  stronger  than  expected  from  model  studies 
and  calculations,4^12  this  interpretation  was  immediately 
questionable,  as  the  D-Pro-Gly  case  (entry  4),  which  also 
lacks  the  interacting  Mel  group,  forms  a  significant  quantity  of 
[2] -catenanes  (33%).  These  data  require  an  explanation  that 
extends  beyond  the  simple  CH— n  interaction  mode. 

A  likely  source  for  this  trend  became  apparent  when  the 
inherent  /Tturn  preference  of  the  core  units  was  considered,  as 
the  D-Pro-Aib,  D-Pro-D-Ala,  D-Pro-L-Ala,  and  D-Pro-Gly  units 
occupy  the  invariant  (3-2  location  in  the  core.  Revealing  was  the 
fact  that  D-Pro-L-Ala,  D-Pro-Gly,  and  D-Pro-Aib  each  prefer  type 
IT  /Tturns,  while  D-Pro-D-Ala  prefers  a  type  T  /?-turn.13  That  is, 
the  three  monomers  preferring  a  type  IT  turn  form  significant 
[2] -catenane,  while  the  single  case  which  does  not  is  highly 
destabilized.  The  lowered  quantity  of  [2] -catenanes  for  D-Pro- 
Gly  relative  to  entries  1  and  2  indicates  that,  while  the  CH —n 
interaction  is  apparently  stabilizing,  this  effect  is  subordinate  to 
the  ability  to  adopt  a  (3-2  type  IT  turn. 

While  the  stability  trends  for  entries  1,  2,  and  4  suggest 
stability  perturbations  through  CH— n  effects,  these  data 
stimulated  a  broader  investigation  examining  (3- turn  prefer¬ 
ences,  predisposition  to  key  hydrogen  bonds,  and  CH —n 
interactions.  This  study  was  broken  down  into  two  main 
components,  each  focusing  on  a  different  face  of  the  aryl  ring  of 
the  CH— n—  CH  sandwich  {(3-2  and  (3 -4).  Since  the  five  X-ray 
structures  and  additional  NMR  spectra  suggest  that  the  core 


only  accommodates  a  type  IT  (3-2  and  a  type  VIII  (3-4  turn,  it 
become  necessary  to  determine  whether  the  adaptability  of  the 
dipeptides  to  these  turn  types  was  a  prerequisite  for  catenane 
formation.  We  first  address  this  issue  for  the  (3-2  turn  in  detail 
using  the  D-Pro-D-Ala  monomer,  and  then  we  present  the  result 
of  additional  perturbations  to  the  (3-2  face  of  the  aryl  ring, 
simultaneously  considering  the  natural  /Tturn  preferences  of 
the  monomer  variations  (Scheme  3).  This  modulation  sought 


Scheme  3.  [2] -Catenane  Self-Assembly  from  Pairs  of 
Hydrazine/Acetal  Monomers 


D-1x  (x=  a,  r-v)  L,L-y2  (R’  =  Naph)  or 
D,L-1x  (x=  b-q)  L  L_y2Ph  (R*  =  Ph) 

(y  =  H,  F,  OH) 
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to  vary  the  structure  of  the  (3-2  CH  donor  to  determine  how 
catenane  stability  was  affected,  vis-a-vis  its  equilibrium 
concentration.  This  was  achieved  by  substituting  the  D-Pro- 
Aib  unit  for  a  variety  of  D-Pro-L  and  D-Pro-cyclic  dipeptide 
structures.  Since  D-Pro-L-X  and  D-Pro-cyclo-X  dipeptides  can 
adopt  type  IT  turns,14  these  variants  were  thought  to  principally 
vary  the  (3-2  CH  donor  of  the  CH— n—  CH  sandwich.  The  (3-4 
CH  donor  component  of  the  CH— n—  CH  sandwich  and  the 
turn  propensities  were  also  examined  through  proline  variants 
and  the  stereoisomeric  dipeptides  of  L-Pro-L-arylGly  (H2  and 
H2ph),  respectively. 
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The  effect  of  the  above-described  mutations  on  catenane 
formation/ stability  was  assessed  by  determining  the  equilibrium 
speciation  of  the  DSA  solutions.  Typical  compositions  included 
cyclic  dimers  through  hexamers  (pentamers  and  hexamers  were 
present  in  low  abundance),  with  each  n-mer  being  composed  of 
multiple  constitutional  isomers  of  the  two  monomers.  HPLC 
was  used  to  monitor  the  approach  to  the  steady  state,  which 
varied  for  different  L-monomers.15  In  most  cases  octameric  [2]- 
catenanes  are  visible  within  5  min  and  a  significant  amount  of 
catenane  accumulates  within  a  day  (SI1  Figure  2).  Notably,  the 
tetrameric  macrocycle  corresponding  to  one  half  of  3x  (i.e., 
[lA])  is  barely  detectable,  and  one  of  the  most  abundant 
cyclic  species  at  early  time,  a  trimer  subunit  of  3x  composed  of 
two  units  of  1  and  one  unit  of  2  ([l22i]),  substantially 
decreases  as  the  cantenane  concentrations  increase. 

To  compare  [2] -catenane  yields  across  the  series,  [2]- 
catenane  concentrations  were  determined  from  the  HPLC— UV 
trace  (289  nm)  and  calibration  curves  established  for  several 
purified  [2] -catenanes;  compound-to-compound  differences  in 
extinction  coefficient  were  negligible  (SI1  Figure  l).  In  several 
instances  a  precipitate  was  noted  at  the  3—5  day  time  point  of 
the  DSA  reaction.  Even  when  this  occurred,  the  catenane 
concentration  in  the  supernatant  either  continuously  increased 
to  a  steady  state,  which  then  began  to  slowly  drop  after  15  days, 
or  reached  a  steady  state  before  the  onset  of  the  precipitation. 
These  non-equilibrium  situations  are  noted. 

By  quantifying  the  amount  of  catenane  in  a  DSA  reaction, 
one  measures  how  its  free  energy  compares  to  that  of  the  other 
components  of  the  mixture.  In  addition  to  affecting  the 
catenane's  energy,  varying  the  DSA  components  could  also 
perturb  the  relative  energy  of  the  other  components  of  the 
equilibrium  mixture  (dimers,  trimers,  etc.),  and  thus  hinder  the 
comparability  of  one  system  to  another.  To  assess  the  viability 
of  cross-system  comparisons,  we  carried  out  full  speciation 
analyses  on  several  DSA  reactions  (D-la/L,L-H2,  D,L-lb/L,L-H2, 
d,d- lb/ L,L-H2,  d-1v/l,l-h2,  D-la/L,L-H2ph,  D,D-lb/L,L-H2ph,  and 
D-lv/L,L-H2ph  in  Scheme  3)  and  looked  for  any  unusually  stable 
(or  unstable)  cyclic  oligomers  that  could  artificially  lower  (or 
raise)  the  amount  of  catenanes  (see  representative  cases  in  SI1 
Figure  3—5).  These  experiments  demonstrate  that  the 
dimers: trimers itetramers  ratio  does  not  change  significantly 
from  system  to  system,  and  suggest  that  the  structural  changes 
being  explored  herein  do  not  significantly  affect  the  relative 
energy  of  the  non-catenated,  less  structured,  cyclic  oligomers 
present  in  the  DSA  solution.  With  the  caveat  that  this  analysis 
does  not  rigorously  rule  out  subtle  effects,  we  report  both 
absolute  and  relative  (to  the  other  DSA  components) 
concentrations  of  the  [2] -catenanes  and  interpret  these  data 
to  reflect  [2] -catenane  stability. 

2.  fi-2  Turn  and  [2]-Catenane  Generation.  2.1.  Turn 
Type.  X-ray  structure  analysis  revealed  that  all  examined 
catenanes  adopt  a  type  IF  fi-2  turn  in  the  core,  obviously 
suggesting  that  this  turn  is  most  suited  to  a  stable  [2] -catenane. 
The  combination  of  a  type  IF  fi-2  and  a  type  VIII  fi-4  creates  a 
core  structure  wherein  two  key  interactions  can  take  place:  (l) 
CH— n  interactions  between  the  X  residue  of  the  D-Pro-X  unit 
and  the  aryl  residue  of  the  L-Pro-L-arylGly  unit  and  (2)  two 
inter-ring  H-bonds  between  two  /Fturn  backbones  (see  red- 
boxed  interactions  in  Figure  2).  As  mentioned  previously,  the 
preferred  /Fturn  of  the  D-Pro-D-Ala  dipeptide  is  type  F. 
Superimposing  this  mismatched  turn  geometry  onto  the 
structure  of  H3a  reveals  several  destabilizing  interactions.  In 
the  type  F  turn  the  amide  is  rotated  nearly  180°, 16  which 
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Figure  2.  X-ray  structure  of  H3a  displaying  the  inter-ring  hydrogen 
bonds  (black  dot-lines).  The  red  box  indicates  a  CH— n  interaction 
(black  up-down  arrow)  and  inter-ring  hydrogen  bonds  between  the  p- 
2  and  p-4  backbones.  The  blue  circle  indicates  the  loss  of  the  inter-ring 
hydrogen  bond  and  gain  of  the  repulsive  interaction  caused  by  a 
proximate  carbonyl  group  in  the  p-2  turn  when  a  type  I'  turn  instead  of 
a  type  IF  turn  is  accommodated  in  the  P-2  turn.  All  hydrogen  atoms 
except  those  forming  inter-ring  hydrogen  bonds  and  the  CH— n 
interaction  (red  box)  are  omitted  for  clarity. 


removes  the  inter-ring  hydrogen  bond  while  simultaneously 
creating  a  repulsive  carbonyl— carbonyl  interaction  (see  the 
shadow  interactions  in  Figure  2).  This  repulsive  term, 
combined  with  the  loss  of  two  CH3— n  interactions  (doubled 
due  to  C2  symmetry),  is  the  likely  cause  for  the  dramatic  loss  of 
[2]-catenane  (>  1000-fold  drop).17 

Interestingly,  a  similar  degree  of  destabilization  can  be  found 
in  the  difference  in  binding  affinity  of  vancomycin  to  the  D-Ala- 
D-Lac  mutant  relative  to  D-Ala-D-Ala  (1000-fold  drop).18  The 
introduction  of  an  ester  oxygen  in  D-Ala-D-Lac  causes  both  the 
loss  of  a  hydrogen  bond  and  the  introduction  of  a  destabilizing 
electrostatic  repulsion  between  a  carbonyl  and  the  ester  oxygen 
in  D-Ala-D-Lac.  This  parallel  suggests  that  the  loss  of  the 
hydrogen  bond  and  the  additional  repulsive  interaction 
expected  for  D-Pro-D-Ala  may  be  the  primary  reasons  for 
destabilization,  with  loss  of  the  CH3— n  interactions  playing  a 
lesser  role.  This  is  supported  by  the  ability  of  the  catenane  to 
accommodate  D-Pro-Gly  in  the  core. 

Although  the  above  structural  rationale  for  loss  of  catenane 
using  D-Pro-D-Ala  is  strong,  two  additional  explanations  were 
considered:  (l)  an  exceptionally  stable  cyclic  oligomer  was 
being  formed,  with  the  concomitant  effect  of  draining  the 
concentration  of  [2] -catenane,  and  (2)  the  loss  of  stability  was 
not  due  to  a  specific  set  of  interactions  at  the  p-2  position,  but 
the  monomer  simply  could  not  occupy  one  of  the  other  two 
positions  (/?- 1  and  /?-3)  in  the  structure  (despite  their 
heightened  structurally  flexibility). 

To  address  scenario  1,  a  full  speciation  analysis  was  carried 
out  on  the  non-catenated  products  for  the  two  experimental 
permutations:  D-Pro-Aib/L-Pro-L-NaphGly  (D-la/L,L-H2), 
which  gives  [2] -catenane,  and  D-Pro-D-Ala/L-Pro-L-NaphGly 
(D,D-lb/L,L-H2),  which  does  not.  The  results  clearly  demon¬ 
strate  that  no  unusually  stable  or  unstable  n-mers  exist  in  either 
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experiment  (see  SI1  Figures  3  and  4).  When  catenane  is 
formed^  this  naturally  lessens  the  total  amount  of  n-mers 
remaining  at  equilibrium;  but  within  this  group  of  cyclicS;  the 
distribution  of  2-mers:3-mers:4-mers  is  unchanged.19  Since  no 
unusual  stabilities  were  evident  in  the  macro  cycles;  these  data 
suggest  that  the  lack  of  [2] -catenane  formation  with  D-Ala  lies 
in  the  [2] -catenane  itself. 

Given  this  evidence;  the  problem  logically  reduces  to  the  d- 
Ala  monomer  not  being  accommodated  into  the  fi-l,  fi-2,  and 
/?-3  positions  or  combinations  thereof.  The  turn  type 
destabilization  rationale  presented  above  would  suggest  that 
the  problem  is  localized  at  the  fi-2  site;  but  since  the  data  do 
not  rule  out  the  possibility  that  the  problem  lies  with  its 
positioning  at  fi-l,  p-3,  or  both;  three  monomer  experiments 
were  designed.  Outlined  in  Scheme  4  is  the  logical  consequence 
of  mixing  monomer  2  with  two  different  1  monomers  on  the 
production  of  3;20  one  of  which  is  the  previously  unincorpo¬ 
rated  D-Pro-D-Ala  (D;D-lb);  while  the  second  is  the  successfully 
incorporated  D-Pro-Aib  (D-la). 

In  the  event  that  D-Pro-D-Ala  is  not  incorporated  in  any  of 
the  three  positions  (i.e.;  NO;  NO;  case  1  in  Scheme  4),  then 
the  three  monomer  experiment  would  form  [2] -catenane 
incorporating  the  D-Pro-Aib  monomer  exclusively;  i.e.;  [(d- 
la)6(L,L-H2)2],  with  no  catenanes  containing  D;D-lb  being 
detected.  If;  on  the  other  hand;  the  D-Pro-D-Ala  monomer  is 
incorporated  into  one  or  a  combination  of  the  fi-sites,  then 
different  constitutional  isomers  are  possible;  each  of  which  can 
be  discriminated  in  an  LC-TOF  analysis.  While  the  fi-l  and  /?-3 
sites  are  chemically  inequivalent;  the  analysis  in  Scheme  4  treats 
them  as  either  equally  probable  or  improbable;  though  their 
contributions  to  the  constitutional  isomerism  of  mixed  [2]- 
catenanes  is  implied.  As  will  be  seen  below;  the  logical 
differentiation  between  the  fi-l  and  /?-3  sites  was  not  necessary 
to  explain  the  data. 

If  the  D-Pro-D-Ala  monomer  could  only  occupy  the  /?-2  sites 
(i.e.;  YES;  NO;  case  3  in  Scheme  4),  only  two  mixed  catenanes 
of  3  would  be  formed.  If  the  opposite  were  true  and  D-Pro-D- 
Ala  was  only  excluded  from  the  /?-2  turn  (i.e.;  NO;  YES;  case  2 
in  Scheme  4),  then  a  significant  number  of  mixed  [2] -catenanes 
would  result.  For  [2] -catenane  3;  this  latter  scenario  would 
require  that  each  isomer  contain  a  minimum  of  two  D-Pro-Aib 
monomers  to  occupy  the  /?-2/2'  positions;  with  the  balance  of 
positions  being  occupied  by  logical  combinations  of  D-Pro-Aib 
and  D-Pro-D-Ala.  Conversely;  a  maximum  of  four  D-Pro-D-Ala 
units  can  be  accommodated  (case  2  in  Scheme  4). 21 

When  D-Pro-D-Ala;  D-Pro-Aib;  and  l;l-h2  were  combined  in  a 
1:1:2  ratio;  a  complex  mixture  of  catenanes  was  obtained; 
though  the  identity  of  each  compound  could  be  deconvoluted 
by  LC-TOF  (Figure  3).  Clear  from  this  analysis  was  the  fact 
that  each  catenane  contained  at  least  2  equiv  of  L;L-H2 
(expected)  and  no  less  than  2  equiv  of  D-Pro-Aib.  The  diversity 
in  composition  in  Figure  3a;  therefore;  results  from  the  D-Ala 
and  Aib  monomers  being  statistically  distributed  into  the  /?-l, 
/?-l';  /?-3;  and  /?-3'  sites;  with  the  fi-2  and  jd-2'  sites  being  solely 
occupied  by  the  Aib  monomers. 

When  the  ratio  of  D-Pro  and  the  L-Pro  monomers  is  changed 
to  3:1;  that  iS;  D-Pro-D-Ala:D-Pro-Aib:L;L-H2  =  3:3:2;  the 
catenane  compositional  diversity  is  unchanged  from  Figure 
3a;  though  the  absolute  intensity  of  each  species  increases  due 
to  the  D-Pro-bias  (SI1  Figure  6a).  On  the  other  hand;  varying 
the  ratio  of  the  two  D-Pro  monomers  (keeping  the  D-Pro  :L-Pro 
ratio  at  1:1)  shows  that  D-Pro-D-Ala-rich  cases  generate  less 
catenanes  and  fewer  isomers;  while  D-Pro-Aib-rich  cases 


Scheme  4.  Logical  [2] -Catenane  Combinations  from  Mixing 
Two  1  Monomers  (D-la;  D;D-lb)  with  the  L;L-H2  Monomer 

g;1 

p-3  p-3 


P-2 


generate  comparatively  more  catenane  but  fewer  isomers  (SI1 
Figure  6b). 

We  interpret  these  data  to  reflect  an  absolute  requirement  for 
two  L;L-H2  monomers  in  the  jd-4/4'  positions  and  two  D-Pro- 
Aib  monomers  in  the  (5-2/ 2 '  positions.22  The  D-Pro-D-Ala 
monomer  can  therefore  be  easily  accommodated  into  the  /?-l 
and  /?-3  sites;  but  not  the  fi-2  site.  We  conclude  that  the  loss  of 
catenane  stability  is  localized  to  the  jd-2/2'  sites;  and  that  the 
principal  requirement  for  catenane  stability  is  adopting  the 
necessary  type  IF  /Lturn.  Once  the  turn  is  in  place;  the  non- 
covalent  interactions  in  Figure  2  function  to  perturb  the 
comparative  stabilities. 
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[(D-1aMD,D-1b)0(L,L-"2)2+ H]+  •  [(D-1a)s(D,D-1b)0(L,L-H2h+HYO 
[(D-1a)5(D,D-1b)i(L,L-H2)2+H]*  •  [(D-la)4(D,D-t6)1(/.,/.-H2)3+H]+0 
[(D-f  aMD,D-1b)2(L,L-"2)2+Hr  •  l(D-1a)j(D,D-1b)2(L,L-H2)3+HY$ 
l(D-1aUD,D-1b)3(L,L-H2)2+HY  •  [(D-fa)4(D,D-fft)o(t,L-H2)4+H]+A 
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Figure  3.  LC-TOF  UV  trace  (289  nm)  of  [2]-catenanes  at  day  7 
generated  from  (a)  D-la:D,D-lb:L,L-H2  =  1:1:2  mixture,  (b)  D-la:L,L-H2 
=  1:1  mixture,  and  (c)  D,D-lb:L,L-H2  =  1:1  mixture  (5  mM  in  25% 
MeCN/CHCl3,  50  equiv  of  TFA).  Each  peak  was  identified  by  LC- 
TOF  analysis. 


2.2.  Aib  Modifications  in  D-Pro-Aib.  The  X-ray  structure 
analysis  and  ROESY  NMR  spectrum  of  H3a3,11  established  a 
CH —n  interaction  between  the  nucleated  naphthyl  group  and 
one  of  the  Aib  methyl  groups  (/?-2).  Since  only  one  CH3  group 
was  upheld  shifted,  it  implied  that  [2]-catenane  stability  might 
not  be  restricted  to  Aib,  as  other  arrangements  should  be 
capable  of  creating  a  productive  CH— n  interaction,  provided 
that  the  preferred  /?-turn  is  type  IT  (vide  supra).  The  catenanes 
generated  by  the  D-Pro-L-Ala  monomer  (entry  2  in  Table  l) 
support  this  hypothesis.  Candidate  structures  for  varying  the  (3- 
2  CH  donor  include  D-Pro-X  and  D-Pro-L-X  dipeptides,  as  these 
predominantly  form  type  IT  /Tturns. 130,23 

As  shown  in  Table  2  and  SI1  Table  1,  all  tested  d-  and  d,l- 
monomers  generated  the  corresponding  [2] -catenanes,  with 
both  aliphatic  and  aromatic  groups  being  well  behaved.  In  the 
D,L-monomers  having  an  aliphatic  alanine  substituent,  a  clear 
steric  trend  was  noted,  with  larger  groups  destabilizing  the 
structure.  Considering  the  dense  nature  of  this  region  (Figure 
4),  the  stability  response  to  steric  changes  (n-Bu  vs  z'-Pr  vs  tert- 
Bu)  was  surprisingly  muted,  as  even  a  tert- butyl  for  methyl 
substitution  provided  a  measurable  quantity  of  [2]-catenane 
(Table  2,  entry  7).  Polar  substituents  (OH  or  F),  however, 
were  disproportionally  detrimental  to  stability.24  This  observa¬ 
tion  is  readily  accommodated  by  a  model  which  invokes  a 
repulsive  polar— n  interaction  in  the  constrained  pocket. 

D,L-Monomers  having  an  arylGly  residue  also  generated  the 
corresponding  [2] -catenanes  (Table  2,  entries  8—10;  SI1  Table 
1,  entries  4—6),  implying  that  a  stabilizing  n—n  interaction 
between  the  naphthyl  group  and  the  arylGly  residue  can  be 
accommodated.  One  example  of  this  family  of  [2] -catenanes 
was  crystallographically  characterized3  and  shown  to  adopt  a 
nonsymmetric  solid-state  structure  featuring  a  CH— n—  n 
sandwich  in  the  core,  wherein  the  (3-2  face  creates  n—n 
interactions  that  are  either  of  the  slipped  n—n  or  edge-to-face 


Table  2.  Yields  of  [2] -Catenanes  in  the  DSA  Solutions 
Formed  from  Various  D-Monomers  and  l,l-h2 


[2] -Catenane  yields  (%)a 


Entry 

R  Group 

V 

H4x 

Oth¬ 

ers15 

Total 

1 

cc 

V 

39  (-) 

25  (-) 

3  (*) 

67  (-) 

2 

d 

35  (60) 

28  (7) 

3(0) 

66  (67) 

3 

g 

^  ^-NH 

f3c 

63  (24) 

0  (0) 

0  (0) 

63  (24) 

4 

hc 

/^CF3 

10  (-) 

5  (■) 

o(-) 

15  (■) 

5 

ic 

39  (-) 

n(-) 

o(-) 

50  (-) 

6 

) 

-2- OH 

29  (15) 

0  (0) 

0(0) 

29  (15) 

7 

k 

8  (14) 

3(3) 

0(0) 

11  (17) 

8 

1 

9 

OMe 

42  (54) 

11(2) 

3(3) 

56  (59) 

9 

m 

9 

cf3 

42  (55) 

14(4) 

2(4) 

58  (63) 

10 

oc 

9 

24d  (-) 

6d  (-) 

ld(-) 

3id(-) 

11 

r 

16  (23) 

20  (9) 

2(0) 

38  (32) 

12 

s 

■£> 

22  (56) 

25  (2) 

22  (4) 

69  (62) 

13 

t 

O 

26  (50) 

31  (10) 

18  (6) 

75  (66) 

14 

u 

O 

28  (54) 

30  (10) 

17(4) 

75  (68) 

Yields  obtained  from  the  UV  trace 

(289  nrr 

1)  of  the  corresponding 

catenane  in  each  DSA  solution  at  day  7—9  using  a  calibration  curve. 
The  calibration  curve  was  obtained  from  the  HPLC-UV  area  (289  nm) 
of  several  known  concentrations  of  pure  catenanes;  Details  in  SI1 
Figure  1.  The  values  in  parentheses  were  obtained  from  3:1  (d:l) 
biased  DSA  solutions.  ^Other  constitutional  isomers  of  3x  and  4x. 
cBiased  DSAs  (3:l)  led  to  precipitates  and  yields  are  not  reported. 
^Precipitation  occurred  within  3—5  days.  The  yields  of  the  generated 
catenanes  were  obtained  at  day  7—8  after  filtering  the  precipitates. 


variety.  In  cases  where  electronically  perturbed  phenyl  glycines 
were  utilized,  the  electron-releasing  para- methoxy  substituent 
gave  more  catenane  than  did  the  p-CF3  substituent  (Table  2, 
entries  9  vs  10),  in  apparent  contradiction  to  the  Hunter— 
Sanders  model 25 

When  cyclic  a,a-dialkylated  analogues  of  Aib  were  used, 
outcomes  that  depended  on  ring  size  were  obtained,  with  the 
optimal  size  being  a  C5  or  C6  ring  (Table  2,  entries  11  —  14).  As 
predicted  by  the  respective  catenane  stabilities,  the  /Tturns  of  d- 
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L-Pro 


Figure  4.  X-ray  structure  of  H3a  displaying  the  compact  structure 
between  the  naphthyl  and  the  Aib.  Space-filled  shapes  are  used  for 
these  groups. 


Pro- 1 -amino- 1-cycloalkanecarboxylic  acids  (Acsc  and  Ac6c)  are 
similar  to  those  of  D-Pro-Aib.14d'26  Although  the  reasons  are  not 
obvious,  these  cyclic  monomers  skew  the  catenane  composi¬ 
tions  toward  H4x  and  the  other  minor  catenanes.  Since  the 
cyclic  a,a-dialkylated  analogues  of  Aib  populate  a  more 
constrained  conformational  space  at  both  the  mutated  and 
adjacent  positions/46'27  these  modifications  apparently  com¬ 
pensate  for  the  structural  changes  that  result  from  replacing  a 
type  IP  for  a  type  I  turn  at  the  /?-3  or  p-1  positions. 

As  mentioned  in  section  1,  the  D-Pro-Gly  monomer,  which 
lacks  the  methyl  group  that  participates  in  the  CH —n 
interaction  but  favors  a  type  IP  /Fturn,  creates  a  significant 
quantity  of  [2] -catenane  under  the  DSA  conditions.28  The 
structural  flexibility  noted  by  the  steric  trend  in  this  position 
suggests  that  the  structure  could  adjust  to  fill  the  void  and  gain 
stabilization  through  a  backbone  CH —n  interaction.  Alter¬ 
natively,  the  competing  cyclic  n- mers  would  somehow  have 
been  disadvantaged,  though  a  speciation  analysis  suggests  that 
this  is  not  the  case  (SI1  Figure  5).  A  three-component  DSA 
reaction  using  a  1:1  mixture  of  D-Pro-Gly/D-Pro-Aib  as  the  1 
components  with  l,l-h2  returned  results  indicating  that,  like  d- 
Pro-Aib,  D-Pro-Gly  has  an  equal  propensity  to  occupy  the  /?- 1, 
p-2,  and  /?-3  positions  (in  contrast  to  D-Pro-D-Ala;  see  SI1 
Figures  8—10). 29  In  all  respects  the  D-Pro-Gly  monomer  is 
statistically  equivalent  to  the  parent  monomer  (o-Pro-Aib)  and 
is  thermodynamically  viable  in  all  positions.  These  data 
therefore  imply  that  a  CH —n  interaction  on  the  fi-2  face  of 
the  arene  is  not  an  absolute  requirement  for  competitive 
stability  or  the  structure  can  significantly  reorganize  to  position 
the  glycine  CH2  group  into  the  CH3— n  position. 

In  an  attempt  to  separate  the  type  IF  p-2  turn  requirement 
from  the  subsequent  inter-ring  hydrogen  bond,  the  D-Pro-L-Pro 
monomer  was  tested.  This  dipeptide  favors  a  type  IF  /?-turn30 
but  lacks  the  amide  NH  to  complete  the  inter-ring  hydrogen 
bond  (Figure  5).  The  ~10%  yield  of  catenanes  (see  SI1  Table 
l)  indicates  that  a  structure  lacking  two  (/?-2(NH)— »/?-4(C= 
O))  of  six  inter-ring  hydrogen  bonds  can  at  least  partially 
compete  with  non-catenanted  macrocycles,  though  at  a  price. 
The  generation  of  catenanes  with  this  disadvantage  implies  that 
the  otherwise  conserved  inter-ring  hydrogen  bond  is  not  an 
absolute  requirement  for  catenane  stability,  as  its  loss  can  be 
partially  offset  by  the  network  of  CH— n,  n—n,  and  inter-ring 
hydrogen-bonding  interactions. 


Figure  5.  Picture  superimposing  an  imaginary  L-Pro  group  instead  of 
the  Aib  (P-2')  in  the  X-ray  structure  of  H3a  (the  black  outlined  shape 
in  the  blue  box).  Since  L-Pro  group  has  the  N  (P-2')  instead  of  the 
amide  NH,  it  induces  a  structure  lacking  an  inter-ring  hydrogen  bond 
(/F2'(NH)— »/?-4(C=0))  between  the  P-2  and  P-4  backbone.  All 
hydrogen  atoms  except  those  forming  inter-ring  hydrogen  bonds  are 
omitted  for  clarity. 


3.  p- 4  Turn  and  [2]-Catenane  Generation.  3.1.  Turn 

Type.  X-ray  structure  analysis  and  H/D  exchange  experiments3 
indicated  that  every  p- 4  turn  in  the  core  is  of  the  type  VIII  form 
and  this  conformation  is  maintained  in  solution.  Previous 
studies  have  shown  that  L-Pro-L-X  dipeptide  units  favor  type  I 
turns,31  which  is  the  form  adopted  by  L-Pro-L-NaphGly  when  it 
is  located  in  the  flexible  P~ 3  loop  of  catenane  F4u  (F4b  in  ref  3). 
In  this  position,  the  naphthyl  group  is  not  involved  in  any 
CH— n  interactions. 

Adopting  a  type  VIII  turn  in  p- 4  thus  requires  the  L-Pro-L- 
arylGly  units  to  undergo  a  turn  change.  A  comparative  analysis 
of  the  standard  turn  dihedral  angles  (type  I,  Oi+v  —64°,  T/+1, 
-27°,  Oi+2,  -90°,  and  %+2,  -7°;  type  VIII,  <DI+1,  -72°,  %+1) 
—33°,  Oi+2,  —123°,  and  'Pi+2,  121°)  indicates  that  the 
conversion  from  type  I  to  type  VIII  requires  an  ~130°  rotation 
of  the  i+2  carbonyl  group  (*Pi+2),  which  links  the  L-arylGly 
carbonyl  group  to  the  1,4-benzamide  of  the  following  turn.  This 
analysis  suggests  that  the  energy  difference  between  type  I  and 
type  VIII  is  sufficiently  small  as  to  be  compensated  by  the 
CH— n  interactions  between  the  Pro  and  arylGly  residues  and 
two  inter-ring  hydrogen  bonds  (/?-4(N)— ►/M/O)  and  p- 
4'(N)— >/?-4(0),  Figure  6). 

Stereoisomeric  L-Pro-L-arylGly  (aryl  =  naphthyl,  H2;  phenyl, 
H2ph)  dipeptides,  i.e.,  L-Pro-D-arylGly,  D-Pro-D-arylGly,  and  d- 
Pro-L-arylGly  dipeptides,  were  investigated  to  probe  the  validity 
of  the  above  suggestion.  In  combination  with  D-Pro-Aib  (D-la), 
each  of  these  cases  generated  only  non-catenated  cyclic 
macrocycles.  Since  the  i+1  and  i+2  configurations  of  the  l,d, 
d,d,  and  d,l  dipeptides  favor  type  II,  F,  and  IF  P~ turns, 
respectively,136'2  '3  6  their  metrical  biases32  would  require 
significant  conformational  change  to  both  residues  (Pro  and 
arylGly)  to  achieve  the  optimal  P~4  structure,  differences  which 
might  not  be  compensated  for  through  weak  secondary  forces. 

3.2.  L-Pro  Modification  in  the  L-Pro-L-arylGly  unit.  As 
depicted  in  Figure  1  and  Scheme  2a,  the  structure  of  H3a 
supports  a  CH— n  interaction  between  a  proline  unit  and  the 
naphthyl  group  (/?-4  to  p-4').  Since  the  acidity  and  geometric 
arrangement  of  these  CH  donors  should  perturb  the  CH— n 
interaction,33  we  investigated  the  effect  of  electronegative 
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Figure  6.  X-ray  structure  of  H3a  displaying  the  inter-ring  hydrogen 
bonds  between  the  j3 -4  and  /3-4'  backbones  (red  dotted  lines).  The  red 
circles  indicate  the  rotation  (the  black  arrow)  of  the  L-NaphGly 
carbonyl  group  (i+ 2  residue)  in  the  [3-4  turn  for  the  turn  conversion 
(type  I  to  type  VIII).  All  hydrogen  atoms  except  those  forming  inter¬ 
ring  hydrogen  bonds  are  omitted  for  clarity. 


substituents  at  HI  and  H2  (Scheme  2a).  Supporting  this 
hypothesis  are  experimental  studies  by  Zondlo  showing  that 
inductive  effects  enhance  proline  CH— n  interactions,  a  and 
computational  studies  by  noting  the  benefit  of  a-halo 
substituents  on  CH— n  binding  strengths. 33b 

To  this  end  we  examined  DSA  reactions  with  mixtures  of  d- 
Pro-Aib  (D-la)  and  variants  of  L-yPro-L-NaphGly  (y  =  OH,  F), 
where  HI  is  either  OH  or  F  (fruns-y-L-proline;  (2S,4R)-4-y-L- 
proline)  or  where  H2  is  OH  (ds-hydroxy-L-proline;  (2S,4S)-4- 
hydroxy-L-proline)  (Scheme  2a),  or  similarly,  using  the  L-yPro- 
L-PhGly  analogues.34  In  all  cases,  DSA  reactions  with  polar  H2 
substituents  (i.e.,  ris-OH  or  ds-F)  provided  no  detectable 
catenane.  However,  in  the  HI  replacement  series  (trans-OH 
and  trans- F),  significant  quantities  of  the  [2]-catenanes  were 
generated  (Figure  7). 35  Complicating  this  analysis  was  the 
onset  of  precipitation  after  3—5  days  for  these  monomers, 
though  the  kinetic  evolution  of  catenane  formation  clearly 
shows  that  [2] -catenane  concentrations  reach  a  steady  state 
before  the  onset  of  precipitation  (see  SI1  Figures  13  and  14). 

Since  electronegative  groups  at  HI  (trans- y)  increase  the 
positive  charge  on  the  interacting  hydrogens,  this  substitution 
should  intensify  the  electrostatic  component  of  the  CH— n 
interaction.33^36  An  additional  factor  that  may  play  a  minor  role 
is  the  fact  that  trans- 4-substituted  prolines  favor  the  Cy-exo  ring 
pucker,  which  is  the  observed  pucker  in  all  structurally 
characterized  catenanes.3,37  The  growth  of  [2] -catenane 
concentration  on  introducing  electronegative  elements  at  HI 
implies  a  slightly  stronger  CH— n  interaction.  The  perturbation 
is  especially  noticeable  when  the  nucleating  aromatic  is  phenyl. 
The  lack  of  catenane  with  electronegative  H2  substituents  (cis- 
y)  likely  reflects  a  destabilizing  electrostatic  repulsion  between 
the  naphthyl  n  system  and  the  proximal  OH  or  F  atom.  The  Cy- 
endo  arrangement  of  ds-4-substituted  prolines37  likely  results  in 
an  additional  destabilization  both  by  depopulating  the  more 
favorable  pucker,  and  by  also  positioning  the  electronegative 
group  closer  to  the  aryl  n—  donor.  However,  since  the  proline 
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Figure  7.  Generation  of  [2]-catenanes  from  D-la  and  L,L-y2  or  L,L-y2ph. 
aOther  constitutional  isomers  of  3x  and  4x.  bAlthough  precipitation  of 
noncatenated  products  occur  after  3—5  days,  the  catenane  reaches  a 
steady  state  within  3  days  (see  the  details  in  SI1  Figures  13  and  14). 


ring  pucker  has  been  known  to  bias  the  backbone  torsional 
angles  of  adjacent  residues,38  it  cannot  be  excluded  that  ds-4- 
substituted  prolines  with  the  C y-endo  arrangement  prefer  a  (3 - 
turn  that  is  not  type  I,  thus  causing  a  destabilization  of  the 
catenane  (vide  supra).39 

Unexpectedly,  the  constitutional  selectivity  of  the  catenane  is 
sensitive  to  the  nature  of  the  HI  substituent,  with  trans-OH 
decreasing  the  amount  of  4  and  trans- F  increasing  it.  In  the 
former  case,  H°3a  accounts  for  more  than  95%  of  the  total 
catenanes.  The  higher  propensity  for  forming  the  non-C2 
product  4  in  the  trans- F  case  was  consistently  observed  in 
DSA  reactions  of  D-monomers  and  L-fransFPro-L-NaphGly 
(h,L-trans'F2,  Figure  8).  In  all  paired  DSA  reactions  (Figure  8), 
the  equilibrium  concentration  of  the  non-symmetric  [2]- 
catenane,  [F4x]eq,  was  higher  than  [H4x]eq;  the  total  yield  of 
F  [2]  -catenanes  was  also  higher.  In  particular,  F4x  accounted  for 
~50%  of  all  catenanes  in  DSA  reactions  generated  from 
cycloaliphatic  monomers  (D-lr— d-1u,  Scheme  3).  The  intrinsic 
preference  for  H4x  in  these  monomers  (Table  2,  entries  11  — 
14)  and  F4x  favoritism  in  fluorine-substituted  monomers  is 
additive,  as  judged  by  the  high  [2] -catenane  concentrations  for 
Is,  It,  and  lu  in  Figure  8.  This  effect  was  utilized  to  advantage 
in  isolating  a  pure  sample  of  F4u  for  X-ray  analysis.3 

The  X-ray  structures  in  the  previous  paper3  indicate  that  the 
proline— n  interaction  is  created  from  two  pseudo-axial  CH’s 
and  one  pseudo-equatorial  CH  interacting  with  the  aryl  ring. 
Computational  studies  of  CH— n  interactions  between  cyclic 
hydrocarbons  and  aromatics  indicate  that  CH— n  strengths 
track  with  the  number  of  contacting  CH’s,  although  the  total 
strength  is  not  precisely  additive.40,41  To  probe  the  effect  of  CH 
donor  number,  the  four-membered  analogue  of  L-Pro  was 
examined.  L-Azetidine-2-carboxylic  acid  (Aze)  was  employed 
(l,l-h2Azc  and  L,L-H2phAze)  along  with  d-  or  d,l-1x  (x  =  a,  b,  m, 
u,  v)  to  determine  if  reducing  the  number  of  CH  donors  would 
decrease  the  amount  of  [2] -catenanes.  This  substitution  is  not 


11438 


dx.doi.org/10.1021/ja302347q  \J.  Am.  Chem.  Soc.  2012,  134,  11430-11443 


Journal  of  the  American  Chemical  Society 


Article 


Table  3.  [2] -Catenaries  Generation  from  d-  or  d,l-  (x  =  a,  b, 
m,  u,  v)  and 


[2]-catenanes 

<&<& 
y3x  y4x 


D-tx  (x  =  r-u) 

D,L-1x  (x  =  d.  e.  j,  k) 


L.L-y2  (y  =  H.  F) 

□  h3x  D  h4x  D  others3 
■  F3x  D  F4x  D  others8 


20 


l  i  1 1  i  j  1 1 


1j  Ik  Id  1e  1r  Is  It 


1u 


Figure  8.  Generation  of  [2]-catenanes  from  d-  or  D,L-la  and  l,l-h2 
(dot-filled)  or  L,L-trans'F2  (filled).  aOther  constitutional  isomers  of  3x 
and  4x. 


flawless,  as  Aze’s  dihedral  preferences  and  the  projection  of  the 
CH's  could  be  slightly  different  due  to  its  flattened  ring 
pucker.42  In  addition,  a  different  turn  preferences  from  L-Pro-X 
cannot  be  excluded,  as  the  turn  preferences  of  the  dipeptide  l- 
Aze-X  are  not  known. 

l,l-h2Azc  and  several  monomers  (Table  3)  were  combined 
and  the  total  steady-state  concentrations  determined.  In  general 
the  Aze  monomer  was  tolerated,  implying  that  l,l-h2Azc  can 
adopt  the  type  VIII  turn  that  l,l-h2  does.  In  the  parent  D-Pro- 
Aib  case,  the  decreased  quantity  of  [2]-catenanes  (from  72%  to 
47%  of  the  total)  implies  a  weakened  CH —n  interaction  due  to 
either  an  imperfect  arrangement  or  a  reduction  of  the  number 
of  CH  donors.  For  entries  3  and  5,  which  position  a  4-MeO- 
C6H4-ring  and  a  glycine  on  the  opposite  face  of  the  naphthyl 
ring,  only  traces  of  [2]-catenane  were  observed.  When  the 
naphthyl  group  was  replaced  with  phenyl  (L,L-H2phAze),  all  cases 
gave  <11%  [2]-catenanes  (SI1  Table  3).  Thus,  weakening  the 
n,  reducing  the  number  of  proline  CH  donors,  and  removing 
the  optimal  CH  donors  in  the  (3-2  position  (Scheme  2)  causes 
total  or  near  total  loss  of  [2]-catenane  viability.  For  the 
naphthyl  cases,  it  appears  that  a  minimum  of  three  CH —n 
interactions  are  needed  for  measurable  stability;  a  Pro— n—  Gly 
case  can  be  accommodated,  but  an  Aze— n—  Gly  cannot. 
Increasing  the  CH  donors  from  the  (3-2  position  in  Aze—Ji— 
Aib  restores  measurable  stability,  suggesting  that  the  catenane 
can  compensate  for  the  loss  of  some  CH— n  interactions,  but 
once  a  threshold  of  three  has  been  reached,  the  structure  is  no 
longer  competitive.  When  the  n  is  phenyl  (2ph  series),  the 
threshold  is  higher. 


[2)-catenanes 

4xAm 


D,L-1x  (x  =  b.  m) 

[2] -Catenane  yields  (%)a 

Entry  R  Group  H  H  T  „  ,  .  b 

3^Aze  4^Aze  A  Otai  ^Pro-Aze 


OMe 


2  b 


3  m 


o 


5  V 


H 


45  (51)  2  (2)  47(53)  25  (13) 

29  (31)  o  (o)  29  (31)  26  (30) 

5d  (21)  od(o)  5d  (21)  53  (42) 

55  (67)  4  (1)  62  (69 )c  13  (1) 

2  (5)  o  (o)  2  (5)  37  (49) 

aYields  obtained  from  the  UV  trace  of  the  corresponding  catenane  in 
each  DSA  solution  at  day  7—9  using  a  calibration  curve.  The 
calibration  curve  was  obtained  from  the  HPLC-UV  area  (289  nm)  of 
several  known  concentrations  of  pure  catenanes;  details  in  SI1  Figure 
1.  The  values  in  the  parentheses  were  obtained  from  3:1  (d:l)  biased 
DSA  solutions.  ^The  yield  difference  between  total  catenanes  obtained 
from  DSAs  employing  l,l-h2  and  l,l-h2Azc,  respectively.  cThe  yields  of 
other  constitutional  isomers  of  3x  and  4x  were  3  (l)%.  ^Precipitation 
occurred  within  3—5  days.  The  yields  of  the  generated  catenanes  were 
obtained  at  day  7—8  after  filtering  the  precipitates. 


■  DISCUSSION 

In  summary,  mixtures  of  two  dipeptides  that  are  epimeric  at 
proline  generate  octameric  [2] -catenanes  under  dynamic  self- 
assembly  (DSA)  conditions;  the  reactions  are  diastereoselective 
and  constitutionally  selective.  The  solid-state  and  NMR 
structural  analysis  of  the  [2] -catenanes  discussed  in  the 
previous  paper3  reveal  that  the  catenanes  are  stabilized  by  a 
combination  of  intra-  and  inter-macro  cyclic  hydrogen  bonds, 
multiple  aromatic  interactions,  and  CH— n  interactions.  The 
outcome  of  our  structure/stability  studies  indicates  that  the 
core  structure  is  especially  important,  with  the  pair  of  CH— n— 
CH  sandwiches,  the  inter-macrocycle  hydrogen  bonds  between 
the  (3-2  and  (3-4  turns,  and  the  predisposition  for  type  IT  (for  (3- 
2)  and  type  VIII  (for  (3-4 )  /?-turns  being  important  variables. 
Indeed,  the  /Tturn  conformation  at  the  core  was  found  to  be 
the  critical  feature  predicting  stability,  while  the  others 
individually  contribute  to  catenane  stability;  to  various  degrees 
they  could  be  weakened  or  removed  with  a  concomitant  cost. 

Mutation  studies  in  the  core  found  that,  when  the  proper 
turns  were  accessible,  the  stability  of  the  catenane  relative  to  its 
competing  non-catenated  structures  could  be  reasonably 
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described  by  a  combination  of  non-covalent  interactions. 
Through  methyl  deletion  experiments,  proline  to  F-proline  or 
HO-proline  mutations,  and  proline  to  azetidine  carboxylic  acid 
mutations,  it  became  apparent  that  a  minimum  of  three  CH —n 
interactions  (doubled  by  symmetry)  are  needed  to  compete 
with  non-catenated  structures.  In  this  regard,  the  CH —n 
interactions  in  the  pair  of  core  CH— n—  CH  sandwiches 
significantly  contribute  to  product  stability.  Sufficiently 
stabilizing  CH— n  interactions  were  additionally  found  to 
compensate  for  the  loss  of  the  key  cross-ring  hydrogen  bonds 
when  the  D-Pro-L-Pro  monomer  was  utilized  (Figure  5).  Each 
of  these  highlighted  examples  supports  the  notion  of  an 
interwoven  network  of  energetically  balanced  non-covalent 
interactions  that  can  adjust  their  structure  to  compensate  for 
mutational  weaknesses  created  in  the  structure. 

Most  illustrative  of  this  concept  is  the  X-ray  analysis  of  the 
catenane  F3sph  (F3dph  in  ref  3)  obtained  from  the  combination 
of  the  D-Pro-Ac4c  and  L-FPro-L-PheGly  monomers.  A 
comparative  analysis  of  this  structure  and  the  parent  H3a 
indicates  that  conversion  of  the  naphthyl  to  a  phenyl  ring  in  the 
CH— 7i—  CH  sandwich  causes  the  expected  increase43  in  the 
CH— n  distances  due  to  a  weakening  of  the  n  donor.  In 
response,  the  network  of  non-covalent  interactions  shortens  the 
adjacent  inter-ring  hydrogen  bonds,  which  concomitantly 
decreases  the  width  of  the  aryl  barrel  and  brings  the  fi-l 
loops  sufficiently  close  that  one  of  them  flips  from  a  type  IF 
turn  to  a  type  I'  to  enable  a  new  inter-ring  hydrogen  bond. 
These  compensating  responses  from  the  non-covalent  network 
help  to  recover  some  of  the  loss  in  stability  that  the  naphthyl- 
to-phenyl  conversion  would  have  otherwise  created  in  the 
CH— n—  CH  sandwich. 

The  cooperativity  of  the  non-covalent  network  can  be  further 
supported  by  a  Gibbs  free  energy  analysis.  Since  the  number  of 
potential  n-mers  is  the  same  for  each  DSA  reaction,  one  can 
semi-quantitatively  ascribe  an  energy  of  the  catenane  relative  to 
the  ensemble  of  competing  structures  by  defining  an 
equilibrium  between  all  competitors  and  the  catenanes  (eq 
l).  The  resulting  equilibrium  constant  (eq  2)  can  then  be  used 

Keq 

oligomers  (cyclic,  acyclic)  ^  [2]-catenanes  (l) 


to  provide  a  baseline  energy,  perturbations  from  which  will 
generate  a  A  AG  that  correlates  to  the  perturbation  gain  or  loss 
in  stability.44  A  comparison  of  these  perturbation  free  energy 
changes  to  theoretical  measures  of  the  expected  gain  or  loss  of 
stability  provides  an  assessment  of  the  independence  of  the 
non-covalent  interactions  in  question.  Of  particular  interest  are 
theoretical  measures  of  CH— n  interaction  strengths:  the 
computational  interaction  strength  for  the  CH4— C6H6 
complex  is  1.45  kcal/mol,  while  it  is  1.92  kcal/mol  for  the 
CH4 — C10H8  complex. 12f 


[[2]-catenanes]pn 

r  H 

eq  non-catenated  oligomers]eq 


(2) 


Using  the  methyl  deletion  experiments  outlined  in  Scheme 
2b  and  reported  in  Table  1,  one  can  predict  how  much  of  a 
drop  in  stability  to  expect  on  changing  a  naphthyl  to  a  phenyl, 
or  removing  a  CH3— n  interaction.  Shown  in  Figure  9  are  the 
relative  energies  of  these  same  changes  determined  exper¬ 
imentally  using  the  approach  in  eqs  1  and  2.  In  the  case  of  the 
two  D-Ala  scenarios,  these  concentrations  were  estimates  from 


D-Pro-D-Ala  (d-1  b)  / 
L-Pro-L-PheGly  (H2Ph) 


D-Pro-D-Ala  (d-1  b)  / 
L-Pro-L-NaphGly  (H2) 


AAG  =  >  4  33  (2.90) 


AAG  =  >  4.96  (3.84) 


|  p-Pro-Gly  (1  v)  /L-Pro-L-PheGly  (H2Ph)  | 


AAG  =  0  28 
(2.90) 


AAG  =  0.22  (0.94) 


I  p-Pro-Gly  (1  v)  /L-Pro-L-NaphGly  (H2)  | 
|i>Pro-Aib  (la)  /L-Pro-L-PheGly  (H2Ph)| 


AAG  =  0  68 
(1.88) 


AAG  =  0.74 
(3  84) 


I  p-Pro-Aib  (la)  /L-Pro-L-NaphGly  (H2) 


Figure  9.  Diagram  showing  AAG  values  (kcal/mol)  between  the  six 
DSAs.  The  values  in  parentheses  are  obtained  from  the  CH— n 
interaction  values  reported  in  the  literature. 12f  In  each  DSA,  AG  was 
obtained  from  the  concentration  ratio  of  catenanes  and  non- 
catenanted  species,  and  AAG  is  the  difference  of  two  DSAs’  AG 
values.  See  the  details  in  SI1  Table  4. 


the  detection  limit  of  the  LC-UV  traces.  Thus,  the  D-Ala  for  Aib 
substitution  corresponds  to  the  top  right  catenane  and  the 
lowest  energy  structure  (right  green  arrow  in  Figure  9).  The 
measured  minimum  energy  difference  is  ~5  kcal/mol.  Since 
each  CH4 — C10H8  interaction  is  worth  1.92  kcal/mol  and  two 
are  lost  in  the  D-Ala  for  Aib  mutation,  a  computational 
prediction  based  on  CH— n  contributions  only  would  predict  a 
AAG  of  3.84  kcal/mol.  A  similar  analysis  follows  for  the  phenyl 
series  (left  green  arrow  in  Figure  9),  except  that  two  CH4 — 
C6H6  interactions  are  worth  2.90  kcal/ mol,  while  the  minimum 
measured  energy  difference  is  ~4.3  kcal/mol.  The  loss  of 
CH— n  interactions  is  clearly  insufficient  to  describe  the 
phenomenon.  As  discussed  previously,  this  larger  difference  is 
ascribed  to  the  change  in  intrinsic  /Fturn  preference  for  these 
two  dipeptides  and  the  absolute  requirement  for  a  type  IF  turn 
at  the  /?-2  position.  Additionally,  contributions  no  doubt  come 
from  the  loss  of  the  aforementioned  inter-ring  hydrogen  bonds 
and  the  turn-on  of  the  carbonyl— carbonyl  repulsions  at  the  i+ 1 
position. 

Results  at  the  opposite  end  of  the  spectrum  were  also  seen  in 
this  data  set.  For  example,  when  the  Mel  and  Me2  positions 
(Scheme  2b)  are  each  replaced  with  hydrogens,  one  should  lose 
the  equivalent  of  two  CH4 — C10H8  interactions,  i.e.,  3.84  kcal/ 
mol  of  stability  (recall  the  /Fturn  preferences  are  the  same  for 
D-Pro-Aib  and  D-Pro-Gly).  Instead,  a  AAG  of  only  0.74  kcal/ 
mol  is  observed  (right  red  arrow  in  Figure  9).  If  one  assumes 
the  energy  difference  reflects  a  complete  loss  of  the  CH— n 
interactions,  this  measured  difference  is  most  reasonably 
interpreted  as  resulting  from  compensating  forces.  That  is, 
the  network  is  sufficiently  entwined  that  losses  in  one  part  of 
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the  web  can  be  neutralized  by  gains  in  another.  Phenyl  for 
naphthyl  changes  were  found  to  behave  similarly  (blue  arrows 
in  Figure  9). 

The  complexity  in  the  [2]-catenane  leads  to  numerous 
outcomes  on  mutagenesis.  In  some  instances,  the  deletion  of 
seemingly  important  interactions  (naphthyl  to  phenyl,  Figure 
9)  causes  little  stability  loss,  as  the  network  of  stabilizing  forces 
can  adjust  the  balance  to  compensate  for  the  loss.  In  other 
circumstances,  the  excision  of  a  contributing  interaction 
(removal  of  CH3  in  Scheme  2b)  leads  to  disproportionate 
losses  in  stability  as  larger  primary  forces  are  revealed.  The 
question  of  cooperativity  and  multi-valency  is  thus  distorted  by 
the  situation- dependent  strength  of  the  individual  non-covalent 
contributors  to  stability.  That  is,  in  a  complex  system,  the 
ground-state  structure  is  a  compromise  (at  the  individual 
interaction  level)  that  maximizes  the  global  balance  of  forces. 
When  the  balance  is  shifted  by  a  mutation,  it  is  possible  that  a 
new  minimum  can  be  achieved  through  a  rearrangement  that 
best  rebalances  the  energetic  contributors. 

The  parallels  to  biological  scenarios  are  many.  For  example,  a 
glutamine  (Gin)  for  glutamic  acid  (Glu)  mutation  at  the  /?-6 
position  of  hemoglobin  (Hb  Machida)45  causes  no  conforma¬ 
tional  or  functional  changes.  The  replacement  of  the  same 
residue  by  lysine  (Lys)  or  alanine  (Ala)  also  leads  to  no 
functional  changes  (Hb  C  and  Hb  G  Makassar).46  By  contrast, 
replacement  of  this  glutamic  acid  by  valine  (Hb  S)  causes  a 
severe  structural  change  and  a  great  diminishment  of  the 
deoxygenated  Hb  S  solubility.  The  subsequent  polymerization 
leads  to  sickle  cell  anemia 47  Thus,  several  mutations  can  be 
accommodated,  but  one  particular  mutation  leads  to  a  shift 
toward  a  different  thermodynamic  state  with  negative 
consequences.  The  catenane  mutagenesis  recapitulates  this 
phenomenon. 

This  [2] -catenane  is  an  uncommon  example  of  complex 
folding  phenomena  in  a  biomimetic  system.  Rather  than 
isolating  a  single  non-covalent  interaction,  as  is  the  case  in  most 
model  systems,  these  catenanes  provide  a  model  for  how 
multivalent  systems  can  compensate  for  unfavorable  mutations 
and  minimize  the  “toxicity”  of  many,48  but  not  all  mutations. 
This  is,  of  course,  a  crucial  molecular-level  characteristic  that  is 
required  for  evolution. 

■  CONCLUSION 

Mutation  studies  of  the  key  non-covalent  interactions  in  the 

[2] -catenanes  described  here  suggest  that  the  structures  are 
complex  enough  to  mimic  numerous  key  properties  of  large 
protein  systems.  The  manipulation  of  structure  and  concom¬ 
itant  changes  in  stability  provide  insight  into  the  interplay 
between  the  components  of  a  network  of  non-covalent 
interactions.  Even  in  a  self-assembled  complex,  the  complexity 
of  the  structure  requires  that  a  balance  of  forces  be  found,  and 
this  balance  can  fluctuate  as  structural  mutations  occur.  This 
network  of  forces  provides  the  means  to  compensate  for  non- 
optimal  changes  or  even  explicitly  reject  these  mutations  that 
create  insurmountable  energetic  costs.  These  fundamental 
properties  are  therefore  not  exclusively  found  in  large  biological 
systems. 
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